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j  intkoducxiok 

Computer  techniques  de.3i.gned  to  construct  numerical  taxonomies 
from  laxgc,  complex  arrays  of  data  have  developed  rapidly  in  the  past  few 
years.  It  has  been  the  experience  of  many  who  have  attempted  to  employ 
these  techniques,  however,  that  mathematically  logical  taxonomies  are 
difficult  to  relate  to  the  original  data  from  which  the  taxonomies  were 
drawn.  This  difficulty  lias  all  too  often  left  the  researcher  with  a  neat 
list  of  the  sets  of  cases  which  are  numerically  similar,  but  with  only  the 
vaguest  grasp  of  what  the  grouping  similarities  might  be.  This  report 
introduces  a  simple  computer  technique  to  plot  the  underlying  similarity 
of  groups,  " 

There  are  two  basic  types  of  numerical  taxonomy;  hierarchical,  aiid 
cross-sectional.  The  former  taxonomy  -  the  type  generally  employed  in 
biology  •>  classifies  life  from  a  single  instance  of  a  living  organism, 
through  the  species,  phylum,  kingdom,  tc>  all  of  organic  existence.  The 
hierarchical  taxonomic  scheme  assumes  dynamic  similarity  for  grouping, 

A  cross-sectional  taxonomy,  the  type  most  generally  employed  by  social 
scientists,  assumes  static  similarity,  A  case  is  grouped  as  a  member  of 
one  of  U  groups,  each  case  is  a  member  of  one  and  only  one  group,  and  the 
group  is  assumed  to  be  qualitatively  distinct  from  all  other  groups, 

II.  DEVELOPMENT  OF  THE  GROUPING  PROBLEM 

A  Case  is  characterized  by  the  values  it.  has  for  a  range  of 
variables.  These  values  may  be  specific,  as  the  national  income  of  France 
in  U.  S.  dollars,  or  they  may  be  general,  os  the  freedom  of  group  opposi¬ 
tion  in  France  measured  by  a  two  point  scale.  The  numerical  scores  for 
these  characteristics  of  the  nation.  Franco,  form  what  we  shall  call  an 
attribute  (or  characteristic)  profile  and  can  be  shown  graphically. 
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Figure  1 

ATTRIRDTE  PROFILE  FOR  FRANCE  IK  1953* 

*1-2  S.D. 

+1  S.D. 

MEAN 

4 

-1  S.D, 

-2  S.D. 

ENERGY  INCOME  FREEOP  DESi'.TY  CTHLC 

^Plotted  values  arc  standard  scores  for  France  from  five  standardized 
characteristics  on  eighty-two  nations.  The  plotted  characteristics 
are: 

ENERGY  -  energy  consumption  per  capita 
INCOME  -  national  income 
FREKOP  -  freedom  of  group  opposition 
DEN STY  -  population  density  . 

CATKLC  -  ratio  of  catholics  to  the  population 
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To  illustrat  e  grouping  procedure,  lot  us  assume  Mint  we  are 
interested  In  const  rue tin<  a  tr.xonor.iy  from  eighty- -two  nations  using  as 
our  index  of  similarity  the  five  characteristics  of  Figure  1.  If,  on 
a  transparent  material,  vc;  mere  to  draw  eighty- two  separate  profile 
graphs,  one  for  each  nation,  vc  could  then  build  a  taxonomy  simply  by 
superimposing  sheets  of  the  transparent  material.  Suppose  we  look  the 
graph  for  France  shown  in  Figure  1  and  superimposed  another  graph  from 
one  of  the  eighty-one  remaining  transparent  sheets,  continuing  the  process 
until  we  found  the  line  which  coincided  most  closely  with  that  for  France. 
Let  ur.  assume  that  the  second  graph  is  the  attribute  profile  of  West 
Germany.  Talcing  the  graphs  for  both  France  and  West  Germany,  we  super¬ 
impose  a  third  profile  from  among  the  remaining  eighty  graphs  continu¬ 
ing  until  we  again  find  the  one  most  similar  to  the  two  already  grouped. 

As  vc  continued  this  exercise  we  would  observe  that  the  lines  of  our 
superimposed  profile  group  would  spread  increasingly  over  the  sheet. 

We  would  conclude  that  as  the  number  of  nations  in  the  group  increases, 
the  similarity  of  profile  for  that  group  would  decrease. 

Let  us  assume  that  vc  were  interested  in  building  a  hierarchical 
taxonomy  from  the  transparent  sheets.  We  would  first  lay  the  eighty-two 
profiles  side  by  side  and  then  find  the  two  profiles  out  of  all  combina¬ 
tions  of  two  which  were  most  similar  and  superimpose  them.  This  would 
leave  eighty-one  profiles.  We  would  again  look  for  the  two  profiles  most 
similar  and  again  superimpose  them.  As  we  continued  this  exercise  we 
would  find  that  the  visual  criteria  for  similarity  of  profile  would  have 
to  be  relaxed.  We  would  continue  to  relax  the.  similarity  criteria  until 
all  eighty- two  profiles  were  superimposed.  If  vc  kept  account  of  the 
order  in  which  the  profiles  were,  grouped,  we  could  draw  a  taxonomic  map 
of  our  procedure  similar  to  that  of  Figure  2. 
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For  social  scientists  however,  the  hierarchy  of  profile  groupings 
may  not  be  salient  and  we  might  try  to  build  a  cross-sectional  taxonomy 
from  the  eighty-two  profiles  assuming  static  similarity,  We  could  start 
by  specifying  the  k  number  of  groups  v?e  were  interested  in  or  we  could 
specify  the  level  of  group  coherence  or  similarity  we  were  willing  to 
accept  and  then  see  how  many  groups  resulted.  Either  way,  we  would  try 
the  various  permutations  of  super imposition  until  we  found  the  best 
cross-section  for  our  purposes. 

The  permutations  at  each  step  of  our  hypothetical  grouping 
procedures  would  be  so  numerous  for  the  eighty-two  profile  graphs  that 
they  would  preclude  actually  attempting  to  construct  taxonomies  in  this 
way.  We  must  loo!;  to  computer  techniques  for  assistance. 

III.  APPLICATION 

The  transparent  graph  illustration  presents  grouping  procedure 
as  the  matching  of  characteristic  profiles.  The  available  computer 
techniques  do  not  match  profiles,  hut  instead  reduce  the  profile  stat¬ 
istics  to  single  indices  of  similarity  or  distance  between  the  cases  to 
be  grouped,  and  then  match  the  indices.  The  interpretation  of  computer 
taxonomies  lias  been  difficult  because  the  indices  of  similarity  upon 
which  the  methods  depend  reflect,  but  do  not  reproduce  the  original 
profile  characteristics. 

In  the  transparent  graph  illustration  wc  had  attempted  to  group 
cases  by  their  scores  on  five  uncorrelated  variables.  The  standard 
scales  by  which  the  attribute  profile  values  were  measured  can  be  viewed 
as  Cartesian  coordinates  of  a  five  dimensional  space.  Each  of  the 
coordinate  axis  of  this  space  is  at  right  angler,  to  all  others,  since  the 
variables  are  uncorrelatcd.lt  then  becomes  possible  to  represent  the 
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nation  profiles  ns  single  points  in  the  Cartesian  coordinate  system  of 
the  five  variables.  Each  nation  has  a  unique  location  in  this  space  find 
tills  location  is  a  unique  Euclidean  distance  from  every  other  point  in 
the  space.  The  similarity  (congruence)  of  profiles  is  thus  measured  by 
the  similarity  of  spatial  location  -  the  distance  between  points.  The 
question  of  profile  similarity  way  thus  be  reduced  to  measuring  the 
Euclidean  distances  within  this  space. 

Since  Euclidean  distances  between  cases  measure  similarity,  they 
can  he  used  to  develop  taxonomies.  Our  concern  at  this  point  is  not  the 
development  of  a  taxonomy,  however,  but  with  the  underlying  similarity 
the  taxonomy  represents. 

A  hierarchical  taxonomic  technique  was  employed  to  group  the 
eighty- two  nations  on  their  Euclidean  distances  in  the  space  of  the  five 
characteristics  of  Figure  1.  A  dendrogram  displaying  the  resulting  taxo¬ 
nomy  .i*  shown  in  Figure  2.  The  vertical  lines  represent  the  joining 
together  of  nations  into  n  group,  The  distance  between  nations  in  the 
same  group  is  increased  as  the  procedure  works  from  left  to  right  in  the 
dendrogram.  At  the  extreme  left  no  two  nations  are  grouped  and  at  the 
extreme  right  all  nations  form ,«  single  group. 

As  social  scientists,  we  would  probably  not  be  interested  in  the 
entire  dendrogram  of  Figure  2.  Consequently,  two  interpretive  problems 
arise.  First,  which  groups  should  we  extract  from  the  dendrogram  for 
presentation  as  our  cross-section?  Second,  once  we  have  selected  our 
groups,  how  can  v;e  interpret  the  group  similarities?  To  treat  the  latter 
problem  we  must  return  to  the  initial  data  -  to  the  profiles  themselves. 
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IV.  GEKErAL  DESCKIPTIOW 

Assume  that  vc  arc  interested  in  characterizing  a  group  of  k 

size  given  us  by  the  taxonomic  method.  VJe  calculate  a  group  score  for 

each  of  the  original  characteristics  by  adding  together  the  member  scores 

and  dividing  by  k,  the  number  of  numbers  in  the  group.  This  mean  score 

we  will  call  a  group  profile  score  for  a  characteristic.  Since  we 

normally  find  variation  about  the  group  mean  score  (group  dissimilarity) 

we  can  calculate  the  group  standard  deviation  as  a  measure  of  member 

deviation  from  the  group  moan.  This  variance  will  vary  from  characteristic 

to  characteristic  and  will  serve  as  a  measure  .of  group  cohesion  on  a 

characteristic.  If  we  assume  the  distribution  of  member  scores  around 

•» 

the  mean  profile  score  for  a  group  to  be  normal,  wc  can  add  a  confidence 
interval  to  the  group  mean  score.  A  one  standard  deviation  confidence 
interval  about  the  group  profile  score  would  encompass  approximately  two- 
thirds  of  the  member  scores  for  the  group  across  the  characteristics.  A 
group  profile  with  its  confidence  interval  can  be  shown  graphically. 


(Figure  3).  ^ 

The  horizontal  midpoint  of  the  plot:  in  Figure  3  is  the  population  . 
mean  value  for  the  characteristics.  We  would  expect  mean  profile  scores 
for  any  group  v?e  extract  from  a  population  to  tend  toward  the  horizontal 
midpoint  of  the  plot  since  this  portion  of  the  graph  is  the  most  dense 
portion  of  the  variable  scatter,  If  the  group  mean  profile  score  on  any 
one  of  the  characteristics  is  far  removed  from  the  midpoint,  then  that 
characteristic  distinguishes  the  group:  group  members  are  similar  on  the 
characteristic.  The  tight  cluster  of  group  scores  on  national  income 
(IKCOiiL)  in  Figure  3  is  taken  as  the  distinguishing  characteristic  of  the 
,  plotted  group,  ‘  " 
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ENERGY 

INCOME 

FREEOP 

DEKSTY 

CATHLC 

AVERAGE  SCORE 

1.03 

0.78 

-0.15 

0.64 

-0.20 

STANDARD  DEVIATION 

1.104 

0.136 

1.14  A 

1.097 

0.864 

GROUP  RANGE 

3. A 

0.3 

2.3 

2.7 

2.3 

PROFILE  PLOT 


+2  S.D. 

+1  S.D. 

MEAN 

-1  S.D. 

-2  S.D. 


*Group  was  derived  from  S.C.  Johnson's  hierarchical  Clustering  Scheme, 
Diameter  Method.  Ref.  S.C.  Johnson,  "hierarchical  Clustering  Schemes, " 
Psychoraetr ikn .  vol.  32,  No.  3,  September  1967. 

ENERGY  «  energy  consumption  per  capita;  INCOME  national  income;  . 

FREEOP  «  freedom  of  group  opposition;  DEi.STY  »!  population  density; 

CAThLC  >•  proportion  of  cat'nolics  to  the  population. 


\ 


DO 


V.  l).O.i'\  COMTU'i’LU  PROGRAM  PJ.OFILF.  WRITRUP 


X.  DKSCiVj.PTlOi! 

A.  This  program  calculates  and  plots  group  profiles. 

1).  Input  (data)  up  to  200  cases ,  50  variables. 

C.  Output 

1.  input  Matrix 

2.  means  and  standard  deviations  of  input  variables 

3.  standardized  data  matrix  (option) 

4.  group  profile  statistics,  up  to  50  groups 

5.  group  plots  with  one  standard  deviation  confidence  interval 

(option) 

2.  OKDEK  OF  CONTROL  CARDS 


A.  Plot  Symbol  Card 

•* 

1.  col.  1-  C2  b»^MMW-6~5-4-3~2-ib0bdl*2v3*4+5l6+7i-bi-91011121Dl4].5,l6X7lG 


B.  Main  Control  Card 


C. 


1. 

col. 

1-6 

2. 

col. 

7-9 

3. 

col. 

10-12 

A. 

col. 

13-15 

5. 

col. 

lb .  \ 

• 

col. 

21 

.»  . 

•  ..V 

*  .* 

Group  Size  Card 


PROBLK 

number  of  groups  (maximum  50) 
number  of  cases  (maximum  200) 
number  of  variables  (Maximum.-  50) 

1'. standardize  input  data;  blank  do  not  standardize 
1  plot  group  profiles;  blank  do  not  plot 

(maximum  size  of  group  50  cases) 


1. 

col.  1-3 

number 

of 

cases 

in  first  group 

2. 

col,  4-6 

number 

of 

cases 

in  second  group 

3. 

col.  7-9 

number 

of 

cases 

in  third  group 

26. 

col.  76-78 

•  • 

number 

of 

• 

cases 

•  •  •  • 

in  twenty-sixth  group 

(use  a  second  card  as  needed  to  complete  listing) 
D.  Observation  Number  Card  (one  set  for  each  group) 


1.  col.  1-3  number  of  first;  case  in  the  group 

2.  col,  l\~ 6  number  of  second  case  in  the  group 

3.  col,  7-9  number  of  third  case  in  the  group 

•  ••••••#*•• 

26.  col.  76-70  number  of  twenty-sixth  case  in  the  group 

(use  a  second  card  as  needed  to  complete  listing  for  the  group) 


u 

E.  Variable  Format  Card  (use  2)  both  must  be  included  even  if  second 
is  blank. 

F.  Data  Cards 

G.  Case  Name  Card 

1.  col.  1-6  name  or  number  code  of  variable  (as  many  cards  as 
variables  in  the  input  data  matrix.  Blank  cards 
must  be  inserted  if  variable  names  are  not  unnieci) 

!•  Finish  Card 

1.  col.  1-6  FINISH 
3.  MULTIPLE  JOBS 

A.  For  a  multiple  job,  repeat  card  sets  (A)  through  (li)  for  each  job. 
Card  (I)  signals  the  completion  of  all  jobs  and  is  placed,  at  the 
end  of  the  final  job. 
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6  7 

6(3 

69 

70 

71 
U 

73 

74 

(5 
1 6 

7  7 
76 
JO 

51 
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I  60 


160 
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*( 50 , 501 , SYM( 56) 

00 URL  fc  P KL C l S 1 0 N  OB NA M t ( 2 00  ) , V N AM C { 50 ) , NO B ( 50 ) » F 1 N I SH v PR OB L N 

REAL  LINE  . . . . i  "  . . 

DAT  A  PIN  IS  H/  *  I  IN  1 SK  ’  / 

format (A6 to  13) 

FORMAT  (  201  3,2X1  .  ‘  . . .  "  . . *. . 

FORMAT (20 A4) 

FORMAT (1 IX, 121  Ai) 

FORMAT ( IX*  12, IX, A6, 1X,121A1) 

FORMAI  ( 1  HI  ) 

3,lX,A6,Iir-ll  .3) 

INPUT  MATRIX  A*/)  . . . 'T . 

STANDARDIZED  DATA 
MEAN  AND  SI  AND AMD 


FUKMAT ( 1 X, I 
FORMAT ( 1 2X, 
FURMAl  (12X  , 
FORMAT  (  J.  2  X , 
FORMAT ( 24X , 


F  OR  MAT  (  7  X  ,  1  3 , 2X  ,  A6 , 2X  ,  FI  0 . 3 , 1 4X  ,  F14  .3/  ) 


FORMAT ( 2  X , *C 
FURMAl  (/  1 2X 

format </ix, 

FORMAT (/IX, 

FORMAT  C / 1  X, 

FORMA  1 ( / 1 7  X 
FORMAT (100,1 
FORMAT ( / 1 7X 
FORMAT ( 1H1 , 

1 0  i  CONFIDE! 

FUR MAT (/10X 
FORMAT ( 12X, 

■MX  VARIABLES 
FORMAT (Ac) 

FORMAT  (  1  7  X ,  1HA6.5X)  ) 
FORMAT (2X/2X ) 

FORMA U 1 7 X , A ( A 6 i 7 X j  ) 


MATRIX  R'/I 
DEVIATION  OF 


INPUT  VARIABLES'*//) 


MEAN*,  1 2 X ,  ‘STANDARD  DEVIATION*'//) 


ROUP  PROFILE  VALUES  AND  STATISTICS*) 
•MEMBERS  OF  GROUP* ,13) 

AVI  RAGE*  , 4 X , 3 (F10.4  ,3X  )  ) 

RANGE'  ,6X,  GIF  1C,  4,3  X)  ) 

SI.  DEV.  '  ,3X,  6(F10.4,  3X)/) 

3(13,10X1) 

11  IX,  12,  IX, A6,  IX)  ) 

11(13,8X1) 


PROFILE  PLOT  OF 
CE  INTERVAL*/) 
121  AD 

CHECK  MEAN  AND 

*//) 


PR  OF  00  ft  3 
PROF  01  ID 
PRO  i- Cl  CO 
PROF  Cl  2  0 
PP.E  F  0130 
PR  UFO).  4  0 
PROF Cl  50 
PROF 01 60 
PRCIFOIVO 
PROF  01  60 
PROF  0190 
PR OF 0200 
PROF  02.1.0 
PROF  0220 
PROF 02 30 
PROFC240 
PROF 02  50 
PROF  0260 
PROF  0270 
PRCF0280 
PR  OF  0260 
PROFO30O 
PROF  03 10 
"PjiOF  0320 


FORMAT (///IX THE 


OF 


«'  MARGIN  1 0  (HI: 

*.*/  IX  ,  '  DEV  1/M  ION  IS-, 


IS*  ,12,' 


NUMBER 
ORIGIN 

',  I  3,  * 

DATA  TO  BE  PLOTTED 
FORMA T (  Oil,  1  ?X, *  GROUP  PRODUCT 

i 


GROUP'  ,12, IX,' WITH  ONE  STANDARD  DEV  I A  T  PROF0330 

PROF 03 40 

. .  . . ~ .  PR  Or  0350 

STANDARD  DEVIATION  OF  ST ANOARDI ZED  MAT RPROFC360 

PROF 03 70 

. . . •'  . . . . “PROF  0360 

PROF  0390 
PROF  0400 
PROF  0  4 10 
PR OF 04  ?0 

STANDARD  DEVIATIONS  FROM  THE  PLOT  1  EF  TP  ft  OF  04  '50 
THE  NUMBER  OF  SPACES  PER  S  T  AN0ARDPRi'Jf:044J 


THE 


IS*  , F8.3,* 


*  1 1  X  ,  *  NO .  ‘  ,  IX,  1M7//I 
i*  li  !•«  M  A 1  (  /  6  X  i  1  6 , 4  X  , )  6 1  7  .  3  i 


ORIGIN  IS  LINE*  ,13,*.  THE  RANGE  OF  PROF  c  4  50 
.  T  BE  MIN  1  MUM  VALU  E  J  S * ,  F 7 . 3,  *  .  * )  PROF  0460 
MOMENT  CORRELATION  COEFFICIENTS*  ’////PROF04  70 

PROP 04 00 
PROF-  04  9  ) 


13 


UvLl  1,  MCll  0  MAIN  .DATE  ~  68117  10/24/13 

RL  HU  PLOTTING  SYMBOLS  PROLOG.:. 

.<  L  A  D  (  y ,  *3 C  )  BLANK  ,110  I  ,  X.AS1  ,  (  SYM(  J)  ,  J-“-J  ,86)  PRCJFGf#  - 

hi.  AO  I.\  NUMBER  OF  GROUPS,  NUMBER  OF  OBS  HR  VAT  IONS ,  NUMBER  Ui;  PROF  OSS C 

V  A  .-<  1 A  t  i  L  L  S  ,  A  ML)  OP  1 1  OK  S  10  STANDARDIZE  INPUT,  PLOT,'  AND  CORRELATE  PROFOB  .. 

uO'J  RcAlMS,  10  IP  KOBE  M ,  NGROUP,  NCA  SE  ,  N  VAR  ,  NS  1  AND ,  NP1.0T  ,  NCCiRR  PKOFOi-'-':: 

h  (PI-.OiH  M.  EG.  FINISH)  GOTO  -1909  .  PRDF  00  / !. 

R  LAI)  IN  CROUP  SIZES  . " . . .  ' . . PRO  FOB-.,  j 

RI:AD('.>,  11 .  MNSIZFd),  I=1,N  GROUP)  PR  OF  OB';  0 

READ  in  NUMBER  OF  EACH  OBSERVATION  IN  EACH  GROUP  PROF 0600 

DO  1  J  =  1 ,  NGkOUP  . : . . . .  PKCfOuir 

L-NS  JZC  (  1  J  PROF  06? 

RE  AD (3,  11)  ( NOUS  (  I  ,  J  )  ,  J-l  ,  L  )  PROF  06  ii. 

I-  CONTINUE  .  . . . . “ .  PROFOE  A  ? 

READ  VARIABLE  FORMAT  PROF  Coy; 

KL-AIH  S,  12)  VFMT  PRuEOo’j  ' 

RlAIJ  INPUT  DATA  -  , .  . •:  * . *  .  I'ROEOBV  J: 

UU2  I  -j  ,  NC  A S  f.  P  U  OF  0  6  S 

READ!  i>,  VENT  )  (  A  <  1  *  J  )  ,  Jd  *NVAR)  PR  LIE  OAF  V 

2  CONTINUE  . . . . .  . .  . PROF  07  !• 


03 

0? 


REALM  !>  ,  7  7  )  (  OB  NAME  (  J  J  ,  J-l  ,  NCASt: ) 

REACH  S,  //)<  VNAMtl  J)  ,  J-l.NVAR) 

PRINI  INPUT  DATA 
N Ml  N-D 

1  (  NLA  Si  /  *31  1  +  1 

J  PUD1- Mul)  (NV  AR,  11  )  . 

1EI  JMUIJ.l  y.  0)  J11  (  NVAR/1  l ) 

1  r-  ( JI-UI U.Nt.OI  J“(  N VAR  / 1 1 J  •>  1 

Ul)  402  11=1,1  . . . 

NM1N-NMI  N+  fiO 
DU  4U2  JJ  =  l , J 
K LH:  G  =  (  JJ-1  )  w  H  +  1 
N M 1  N-  M 1  N 0  { N C  A  SE  t  NM 1  N } 

KLNU=M l NO ( ( J J  * 1 1 ) , NV  AR I 
UR1  It  (o,6u) 

WK  IT  (:(  o,  61) 

IF  (11. 1:0.1)  N P.LG=  I 

J  E(  1 1  .1.0.21  NiiCG-Sl 

IF  (II.  CO.  3)  NBE.GM01 

IFdl.EU.A)  NBCG-151 

WR  I  Ttr  <  0,731(1  Nl) ,  1  NU  --K9I  G  ,  KEND I 

r.R  H  L  (  6 , 78  I  (  VNAMfc  (  INI)  I ,  I N  D-K.GLG  ,  KIND  I 

LRJ  T  L  (6  ,80  J 

00  403  KK-NBEG ,NMIN 

krite(oi r>a)  kk,ubma;u-(kk  i,  (atkk.indi,  ind-kge.g , kend i 

CONTI  NUt 

CUNTINUL- 

C  ALGOL  AT  E  KEAN  AND  STANDARD  DEV  I  AT  1 U, 'I  OF  INPUT  DATA  VARIABLES 
00  7  J  ]  ,  NVAR 


PROF-  0  7  i 
P  R  0  F  0  7  2  j 
PROF- C 73 

prop  ova: 
PRO I  07  0 ; 
PROf  0  V 
PROF  07  70 
P RDF 07 TO 
PROF  O/v 3 
pr  j  fob::' 

PROF  GUI j 
PROf  vfi?  :• 
pro fob: > 
prof c a* : 


PROF 

Gi  F 

pro  re  bp 

PROf 

C  8  7 

PKLU 

OB  ' 

prcp 

c ;  9 

prof 

O', . 

PROF 

O';  ; 

PROF 

(•  f  : 

PROF 

0  V  :■ 

PROP 

00-' 

PROF 

C  v 

r'Ror 

09 

Prof 

C9  / 

PROF 

• ;  c-  • 

F  ROE 

C  9  • 

14 


.  I  V  i  t 


d 


li7 

7 


y 

•is 


40  1 


l  ,  Hun  o 


M  A !  N 


DATE  «*  6 fill? 


10/24/13 


I  !  J  1  r:  0  .  C 
10  I  Mi--  O.o 
no  6  1  --  1  i  NC  AS  I; 

Till  -1UH  At  i  .  J) 

1  01  SO  IL'TSO  -i  (  A  (  1  ,  vl  )**2) 

CUN  1  1  NUL 

I  Mu  AN  (  J  J  -  1  0]  /I- 1.0  A  1  (  NCASC  ) 

lf:  (  (1  01  Sf./ FLOAT  (NCASt  )  I”  (  I S4 £•  AN C  J  )**?)  .l.E.0.0)  GO  10  b  7 
STANOt  J)  --MiKTH  TOT  SQ/PLOA  (NCASM  )~  (T  KfcAN't  J  ) **2  }  ) 


Pi<bFl\;oo 

•  PROF- 10 )  o 
PROF  J  0:-.‘0 
"PROF  1C  3*' 
PROF  1040 
prof  lost 
PROF 10 AO 
PROF  1070 
PRilF  10  SO 


CUNTINUl  PROM  000 

CONTINUE  PROF  1J.  00 

PRINT  KHAN  AND  S 1  AND  AFO  DEVIATION  Of  INPUT  DAT  A  VAR  l  ALU.  tS  _  PR0F1H0 

WR  if  ft  6,  00  )  . . 7 . . . . .  . . .  ‘  "  "  7  . "" PROF  1 120 

WKITfcltMC.3)  /PROF  1 130 

WK1  Tt(  (>t  t>4)  PROF  11  4  0 

Wl<  11  U(.»  61) )  (  J ,  VNAML (  J  ) ,  TMl  ANt  J  )  V STANOt  J )  t  J»i  , NVAR)  .  PROF  J  j  SC " 

PROF  1 1 o 0 


1LM  F OK  STANDARDIZING  INPUT  DATA  MATRIX  OPTION 

IMNSI  AND. NT:  .  1  )  GO  TO  99  . ! . 

SI  ANDARDI  7.F  INPUT  VARIABLES 
00  If)  J-ltNVAK 

Oil  9  I-  I,  NCASt  . ""7  . . 

At  I  *  J)  -t  At  I  »  J)  -TMCAN  (  J)  >/SI  ANCt  J  i 
CONTINUE 

PRINT  S  I  AN  U  A  R  D I  7  t  fJ  C  A  T  A  H  A  T  R  IX  II 
Ni  l  I  N  -  0 
I--:  (NCASt :/!))  )  M 
jMOO»MUD(NVAK, 11 ) 

IFt  JNOu.rCl.  0)  J=  (  NVAR  / 1 1 ) 

1  F  (  JKOD  ,Nf.  ,0  )  (  NVAR  /  I  1 )  +  1 

no  4  c  o  i  i-i  ?  i  . . .  '  •*" 

WMIN--NKI  N+S»0 

nu  400  jj  -  ifj 

KflkG  -  (  J  J-- 1  )  *  1 1  +  J.  . .  . . 


NMIN»MI NO t NCASt iNMINI 
KtNLF-  Ml  NO  (  ( J  J  *;  1  1  )  rMVAR  ) 

RR1  1  L  (  o,60) 

l.Kl  T  1:  (  0 1  ID 

IF  (11. 10.1)  NB LG-  1 

I  f  t  I  1  .  !  0  .  )  MB  L  G  “S  I 

IFt  11.10.3)  NblG-iOl 

IT  (I  1.  liO. 4)  NRLG-1  S  1 

WR  1  I  •.  (  6  i  7  3  )  (  I  NU  ,  1  h  0  =  K  S  L  0 ",  K  E  NO ) 


V.  t\  1  i  1  (  u  t  7  ti )  (  VN  A  K 1 1  I K  D  ) »  1 N  D  -  K  U i;  G  ,  K  C  N  0 ) 


vl  R 1  TL  (  0  i  00  ) 

DO  401  KK-NBLG ,NM1 N 

WRITE  Im SO)  KKi  UbNAM.F IKK  )  t t  A ( KK » 

ClJNTlNUl 


JND  )  s  INI)  "KB  LG  »KlNO) 


PROF  11 fti 
’  PROF  1  ICO ' 
PROF  1 1 90 
PROF  1200 
PROF  mV 

Prof  mo 

PROF  1 ?  30 
PROF  }. 2 4 0 
PROF  12  SO 
PROF  IP GO 
"PROF  12  70 
PROF  1  2  CO 
PROF  1290 
PROF) 300 
PROF  1310 
PROF  132  0 
"PRO  FIS  30 
PROF 1340 
PROF  3 3 SO 
PROF  1  3oC 
PROF  1379 
PROF  ’  33 
PROF ) see 
P  R  Cr  J.  4  0  * 
PROF  141  • 
PKOr  14  2  ' 
PROM  43  . 
PROF  1.4  4  0 
PROF  1  4  '■ 
PRfl‘; )  4  . 
PRO I  147 
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i  t  V L  L  1  ,  MUD  0 


MAIN 


OATL  =  68117  10/26/3  3  , 


'.00 

99 

C 

t 


6 

3 


C 


6 


1 


LUNU  NUf: 
CONTINUE 


C AIGUL AT  I:  GROUP  ML  AN  ,  STANDARD'  Dt:V  1  AH  ON,  '  AND  RAN  CL. 
BMAXe- 9999.0  . 


UM I N-9999 .0 

1)0  3  1  -  l.NGRCUP 

DO  4  J  *  l.NVAK 

TOT  20.0 

IUTSU-0.  0 

AM AX =  -99 99.0 


AMI  N ”9 9 09,0 
NS/  =  NS  1/  l.  (  I  ) 

DU  5  K-- 1 t  NS  2 
t.  -  NUGS(1»K) 

ro  I  =•  1U  T  +  A < i.  ,  J) 

TfjrSO«TlTI  so:  (A(!.,  J  )**2) 

II  (/.MAX.  LT.  A(L,J)  )  AMAX  ---  A  (  L  »  J  ) 
l  I*  (  AM  1 N  .  G f  *  A  {  L  ,  .1  )  )  A M  1 N  « ‘  Ail  *  J  > 
Gli.Nl  1  NU  L 


AVCU.J)  «  HJT/FlUAl  (NS1  Zf  d)  )  » 

s i  o i: v c  j,  j  j«o.o 

1  f:  (  (  Tu  T  SO  / 1  •  L  f.I  A 1  (l\‘S  1  ZL  (  1  )  )  )  -  (  AVG  (  I ,  J  )* ’■■■?.)  .If:  ,0*0)  GO  TO  85 
STIJLVJ  I  »  J  )  —  S'JKTJ  (  1UTSO/FLOAT  (MSI  ZCU  )))-(AVG{  1, J)**2  )  ) 

CONI  J N U L  . .  ;  . ■  . . .  . 

]  f:  (  UMAX*  I.T  .  (AVG(  it  J)+STDliV(  I.  J)  ))  BMAX«AVG<  I »  J  )  *STDCV(  I.J) 
IMliMJN.GT.  (  AVG(l  ,  J)  *  SIDE  V(  I  ,  J)  )  )  DflN-AVG  (  1  ,  J)  -ST  DEV  II  ,  J ) 

RAN Gt:  (  1  .  J  )  =  AMAX-AF. !  N  .  .  . .  . . “*  * 

CONI  I NUL 

KAN  G-  U  HA  X+A  .J  5  ( I5M I  N 1 


PR  INI  ■'  GROUP  fit:  All . 
WR  1  Tfc  (  OjfiOl 
Mil  E(  6, 66) 

1)0  405  1=1,  N  GROUP 
WX  I  TL  (  o,67)  I 
NSZ  »  NS  12 (id) 

DU  6  J-l  .MS  Z 
N~NO;)S(  I.J) 
iamt  j)-oaNAMf(w) 


S  TAM  D  A  R  D  0 1  V  )  AH  UN A  U  D  R  AN  GE"  ' 

% 


WR  111(6,72)  (  NOG  S' (1  «  J  )  ,  N ..<[)  (  J  I  ,‘.l=  1  ,  H.S  l) 
J i'i U  D  =•  i  U  J  t )  ( N  V A •< .  8) 


If  (  JMfJD.  LQ.O  )  J  •"  NVAK/U 

1  h  (  JMUD  .  NT.  . «.) )  J  -  (  N  V  A  R/ K  )  +'  J 

DU  40s  JJ  «  I.J 

KOEG-  (  (  J  J-l  )  *6  Hi 

KLiND-'h  IK-'OJ  (  JJ*3)  ,.\VAK) 

tsR  1  TL  (  6 .  7 1)  ( ,  N  ;>  P  G ,  KIN  0  ) 

r.R  il  LI  6  ,  a ).  )  (  V  N  A  K  t.  ( !.')  ,  A  - K  I;  t  G ,  K  PN  D ) 


PROP  146J 
Paul- k  v 
PROF  )  5'  ... 
PRO!" ).  5  ) 
PROF  J. 
PROF  )  53 
PROF  16s 
pr  or- 
PROF  16' 
*"  PROF  i'j/S. 
PRO  FI  !>:■'( 
PROF  1  59 ' 
PRC-rj  6J' 
PROF  16) 
PK OF  in;: 
"  'PROP  16  -55 
PROF  164, 
Pk’.lF  1  65i 
"  PRO  FI  6  6 
PROF  167. 
PROF  16/ 
PR0;  )6.:  • 
PROF  160 
PROF  1 7 C 
PROF  )  70 ; 
PROF  1  7 i C 
PROF  172.' 
PROF).  7/ 
PROF  1  7.0 ; 
PR OF  176 C 
”  PROF  177 
PROF ) 70 
PROF  170 
P ROD  iJ '  . 
PRTJF  lo  J. 

profit;;- 

PRO i-  if.:-/. 
PROP : P4 
PROF  1 5  5 
PROF  )'/.■ 

pro;- it  '< 
Pi' of  )..;•> 
PROF  3 
PRO;  !  o  '• 
PROF  lOi 
PROF  IV l . 

pro;  )«; 

PROF  1V-. 


H> 


G  i  t.  Vl  t  1 ,  GOD  0 


MAJN 


OATS;  68117 


10/24  / 1  3 


S.flEV.  CONFIDENCE 


wR  IUIu,6il)  (  AV  G  H  ,  N)  ,  N-KBI:  G  i  K  f;  i\  (; ) 

WR  1  It  (  is ,  7  0 )  (  SI  Of:  Vll.Mr  N~  KG  EC  ,  Kt:  NO) 
r.RIH  (6,69)  (  RANGE  (  1 ,  N  )  vN»KGEGf  KIND  ) 

■Hi'.  CUM  INCH 
K-  0 

DO  406  1  I  ~  1 ,  1  2 

K  - M  4  . . . 

lFU.Nt.K)  GO  TCI  406 
Vi  l<  1  1  L  ( b  f  6  0  ) 

•  4 C 6  CON  1 J  NOE 
4  09  CON11NUI: 

C 

1 1-  (  NPL.Cn  .  Nr.  .11  GO  TO  977 
C  Pi  01  GOUUP  MEAN  ACROSS  VARIABLES  WITH  ONE 

C  INTERVAL  P.UUNl)  J  KG  1  HE  PLOT  LOCATIONS 

KSP-  100.  C /KANG 
LA  =  AlsS  UMlNK  1 .0 
KOR  -  (  KSP'v  LA  )  <1 

U  K  1  T  L  (  6 ,  106)  LA,KSi’ ,  KGR  ,  RANG  ,8MTn“" 

DO  995  1  =  1,  MG ROUP 
DU  1C  li  J  “  1,121 
10 r>  I.  IN  E  (  J  )  -  ill.  ANK 
N-2  0-  (2  *  LA  ) 

DO  91  11 =1,121 ,KSP 

M»N-i  . . 

JK I+i 

LINK  I  I  )  -  S  Y  '  T  { !-i) 

L  INK  JJ--SYMI  N ) 

51.  N  =  N+2 

1 F  <  LA  .  (j  T  .  V  )  LI  ME  (  KOR)  «  SYM  ( 20 1 
WRIT  l:  (6  i  74  )  I 
NS/.-NS1/K  I) 

DO  52  J“  1  »N  S/. 

N-NU8S  (  1 1  J  ) 

52  NOb  (  J )  ~ URN  AM E  ( N ) 

KK  )  I  K  6 ,  7 2  )  (  NGI'j SU  , J  >  , K Oil (  J)  » J-  1  »NS2) 

M'<I  TC  16 ,75  )  LINE .  "  .  . . . 

00  111  vl "  1  »  1 2  J 
101  LlNfc(J)*  1)01 

r.R  I  Iti(  o',  3  1  )  LINE  . . .  . .  ~  '  " 

•  DO  10  3  vl  -  J  »  1. 2  1 
10  J  11NKJ)--  CLANK 

DO  1 9 U  N\  “1,121,  KS  I*  '  .  . •  r'-‘" 

.14  1  t 1 Nl ( NN  > -Du  1 

D'J  30  K-  lvNVAk 

R«(  (Avon  ,  K)  *  H  CAT  (  KSP  )  )  +  0 . 5  HFIOAT  {KOR ) 

IT  o’  -  (  I  (  AVGf  )  ,  K  )  +  SI  i.'I:  V  (  1  ,K)  )  *r  LOATC  NSP)  )'+0.  5) +f  LUATC  KUR) 

•  MM  a  (  {  l  AV  G  (  J  ,  K  )  -Si  DEV  I  1 ,  K  )  )  <  FL  DAT  (KSP)  I+0.5J  +  FLOA  1  I  KOR  ) 


PKClr'  J.  9 
PROF  ).  •  . 

p  nor  ].  9  v 
pro f  i  . 

» KOFI 4  - 

prof?:: 

PROF  20 : 
PR0F2C : 
PROF  2  .V 
P  RO!-  i  s.  ■ 
PROF  ?.C  f 
PilCF  7  0  >• 
PROF  27  • 
PROF' 2  a: 
PROF  2<.  . 
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To  characterize  a  cross-sectional  taxonomy  from  the  dendrogram 
of  Figure  2,  we  selected  a  cross-section  of  eight  groups  end  plotted 
their  profiles  with  the  profile  program.  The  program  features  an 
automatic  scaling  device  which  scales  the  plots  to  the  range  of  the 
statistics  to  he  plotted  in  each  job.  The  United  States,  the  sole  mci-ber 
of  the  eighth  group,  proved  to  be  an  extreme  outlier  on  national 
income  and  energy  consumption.  To  demonstrate  the  variable  scale 
feature,  groups  1  and  2  were  plotted  by  themselves,  without  the 
necessity  to  scale  the  plots  for  the  United  States.  The  Plots  for 
Groups  3  and  6  are  products  of  the  computer  run  when  the  United  States 
was  plotted.  The  computer  output  for  these  plots  and  statistics  for 
the  first  three  groups  are  shown  on  the  following  pages. 

To  allow  for  a  large  number  of  variables,  plots  are  made 
vertically  down  the  page,  beginning  with  variable  1  at  the  top.  The 
group  mean  score  is  denoted  by  an  "X”  and  M*M  denotes  the  one  standard 
deviation  confidence  interval  about  the  mean  score.  If  a  single 
asterisk  appears,  ns  in  the  group  1  plot  for  INCOME,  member  standard 
deviation  is  negligible  and  the  confidence  interval,  converges  on  a 
single  point.  If  the  asterisk  is  very  close  to  the  other  asterisk, 
there  may  he  no  "X"  between  them.  It  is  assumed  that  the  mean  score 
would  fall  between  them  in  this  case.  The  user  may  wish  to  shade  the 
confidence  intervals  in  the  output  ns  an  aid  in  interpreting  the 
profile  variation  across  characteristics  for  the  gr0vip. 
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